Coder Reliability and Misclassification in the Human Coding of Party Manifestos

نویسندگان

  • Slava Mikhaylov
  • Michael Laver
  • Michael Alvarez
چکیده

The Comparative Manifesto Project (CMP) provides the only time series of estimated party policy positions in political science and has been extensively used in a wide variety of applications. Recent work (e.g., Benoit, Laver, and Mikhaylov 2009; Klingemann et al. 2006) focuses on nonsystematic sources of error in these estimates that arise from the text generation process. Our concern here, by contrast, is with error that arises during the text coding process since nearly all manifestos are coded only once by a single coder. First, we discuss reliability and misclassification in the context of hand-coded content analysis methods. Second, we report results of a coding experiment that used trained human coders to code sample manifestos provided by the CMP, allowing us to estimate the reliability of both coders and coding categories. Third, we compare our test codings to the published CMP “gold standard” codings of the test documents to assess accuracy and produce empirical estimates of a misclassification matrix for each coding category. Finally, we demonstrate the effect of coding misclassification on the CMP’s most widely used index, its left–right scale. Our findings indicate that misclassification is a serious and systemic problem with the current CMP data set and coding process, suggesting the CMP scheme should be significantly simplified to address reliability issues.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Coder Reliability and Misclassification in Comparative Manifesto Project Codings∗

The long time series of estimated party policy positions generated by the Comparative Manifesto Project (CMP) is the only such time series available to the profession and has been extensively used in a wide variety of applications. Recent work (e.g. Benoit, Laver, and Mikhaylov 2007; Klingemann et. al. 2006, chs. 4–5) focuses on non-systematic sources of error in these estimates that arise from...

متن کامل

On the Representation of Bloom's Revised Taxonomy in Interchange Coursebooks

This study intends to evaluate Interchange series (2005), which are still fundamental coursebooks in the EFL curriculum settings, in terms of learning objectives in Bloom’s Revised Taxonomy (2001) to see which levels of Bloom's Revised Taxonomy were more emphasized in these coursebooks. For this purpose, the contents of Interchange textbooks were codified based on a coding scheme designed by th...

متن کامل

Designing a model for upgrade the productivity of human resources in the National Iranian Oil Company

The purpose of this research is to design a model system for upgrade the productivity of human resources in the NIOC‌. This research, in terms of philosophical foundations of research in the paradigm of interpretiveness, from the perspective of purpose is part of exploratory research and the method of performing the work qualitatively; From the research strategy of data foundation theorizing wi...

متن کامل

Treating Words as Data with Error: Uncertainty in Text Statements of Policy Positions

Political text offers extraordinary potential as a source of information about the policy positions of political actors. Despite recent advances in computational text analysis, human interpretative coding of text remains an important source of text-based data, ultimately required to validate more automatic techniques. The profession’s main source of cross-national, time-series data on party pol...

متن کامل

Investigating the Predominant Levels of Learning Objectives in General English Books

This study investigated nine General English books (five produced by non-native Iranian speakers and four produced by native speakers) in terms of learning objectives in Bloom’s Revised Taxonomy (2001). The aim was to find out which levels of Bloom’s Revised Taxonomy are dominant in the books. So, the contents of the books were codified based on a coding scheme designed by Razmjoo and Kazempurf...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010